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A  Thunderstorm  Prediction  Technique 
Based  on  a  Perfect-Prog  Approach 


1.  INTRODUCTION 

A  thunderstorm  prediction  technique  has  been  developed  using  synoptic  and  subsynoptic-scale 
data  for  diagnosis  and  short-range  probability  forecasts  of  any  storm  (nonsevere  and  severe)  and 
severe  storm  occurrence  over  the  continental  United  States  during  May,  July,  and  September  of 
1992  and  1993.  The  goal  was  to  adopt  a  methodology  different  from  the  currently  operational 
Model  Output  Statistics  (MOS)  approach  used  by  the  National  Weather  Service  (NWS)/National 
Center  for  Environmental  Prediction  (Reap,  1994)1  due  to  the  need  to  address  global 
requirements  where  real-time  or  near  real-time  NEXRAD  data,  for  example,  was  not  available. 
Our  approach  limits  the  number  of  predictors  considered  (in  the  same  vein  as  Albers,  1987)2; 
however,  it  does  not  limit  the  number  to  include  only  commonly-used  storm-potential  indices  such 
as  the  K  Index,  Severe  Weather  Threat  Index  (SWEAT),  Showalter  Index,  etc.  (Knapp,  1995)3. 
We  decided  against  deriving  predictors  other  than  those  produced  by  SHARP  (NWS’  SKEW- 
T/Hodograph  Analysis  and  Research  Program),  due  to  the  time  involved  in  using  and  analyzing 
predictors  for  the  12  CONUS  regions  and  three  predictand  types. 

Prediction  equations  would  be  used  to  satisfy  Air  Force  Weather’s  Theater  Battle 
Management  (TBM)  stated  requirements  for  maximum  cloud  tops  and  coverage  and  implemented 
in  the  Global  Theater/Weather  Analysis  and  Prediction  System  (GTWAPS).  Theater  forecast 
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requirements  extend  from  0  to  6,  12,  24,  and  36  hours;  hemispheric  requirements  include  theater 
and  also  48  and  72  hours.  GTWAPS  is  currently  planned  to  be  fully  operational  after  the  year 
2000.  The  thunderstorm  prediction  technique  described  herein  was  thus  tailored  for  use  with  a 
mesoscale  model,  but  not  for  a  specific  model,  and  so  the  MOS  approach  was  abandoned  early  in 
the  project  for  a  perfect-prog  approach.  Other  TBM  requirements-for  clouds,  turbulence,  icing, 
surface  visibility,  and  present  weather-were  addressed  by  other  researchers,  in  the  same  Branch  at 
Air  Force  Research  Laboratory,  to  maximize  collaboration  and  ensure  consistency  and  agreement 
among  the  output  of  the  various  developmental  algorithms. 

Follow-on  testing  using  MM5  modeled  forecast  data  from  the  surface  up  to  100  mb  for 
August,  1993,  is  described  later  in  this  report. 

A  stepwise  linear  regression  technique  has  been  used  on  the  most  promising  predictors  for 
each  of  12  CONUS  regions  and  the  overall  CONUS.  Correlation  coefficients  were  first 
calculated  and  analyzed  to  identify  the  most  promising  predictors  from  a  set  of  121  predictors. 
Table  1  identifies  these  predictors,  which  were  generated  from  SHARP  (Hart  and  Korotky)4. 

2.  PREDICTORS 

Extensive  as  the  initial  pool  of  predictors  was,  it  did  not  contain  those  dynamic  parameters 
such  as  vertical  velocity,  differential  vorticity  advection,  and  warm  air  advection,  which  would  be 
so  useful  in  determining  storm  initiation.  Disregard  of  such  parameters  can  lead  to 
overforecasting  in  regions  where  the  tropopause  is  frequently  unstable  but  dynamics  are  weak-the 
result  being  thunderstorms  do  not  occur  nearly  so  often  as  one  might  infer  (McGinley,  Runk,  and 
Alleca,  1987)5.  An  overlap  of  the  stability  and  dynamics  is  necessary  to  get  the  best  estimates 
of  storm  potential.  For  this  study,  however,  we  used  a  perfect-prog  approach  in  lieu  of  forecast 
data  in  a  MOS  approach  due  to  constraints  mentioned  earlier. 


Table  1.  List  of  121  Predictors  Reviewed  in  Developing  a  Thunderstorm  Prediction  Technique. 


T 

surface  temp  (deg.  C) 

FF300 

300  mb  wind  speed  (knots) 

TD 

surface  dewpoint  (deg.  C) 

DD300 

300  mb  wind  direction  (deg.) 

TW 

surface  wetbulb  temp  (deg.  C) 

T200 

200  mb  temp  (deg.  C) 

TE 

surface  theta-E  (deg.  K) 

HT200 

200  mb  height  (meters,  AGL) 

HT 

surface  height  (meters,  AGL) 

FF200 

200  mb  wind  speed  (knots) 

FF 

surface  wind  speed  (knots) 

DD200 

200  mb  wind  direction  (deg.) 

DD 

surface  wind  direction  (deg.) 

BP 

positive  buoyant  energy  (CAPE)  (j/kg) 

T850 

850  mb  temp  (deg.  C) 

BM 

negative  buoyant  energy  (j/kg) 

TD850 

850  mb  dewpoint  (deg.  C) 

H20IN 

precipitable  water  (in) 

TW850 

850  mb  wetbulb  temp  (deg.  C) 

RI 

bulk  richardson  number 

TE850 

850  mb  theta-E  (deg.  K) 

MLI500 

negative  lifted  index  to  500  mb 

HT850 

850  mb  height  (meters,  AGL) 

MLI300 

negative  lifted  index  to  300  mb 

FF850 

850  mb  wind  speed  (knots) 

MSHOW 

negative  showalter  index 

DD850 

850  mb  wind  direction  (deg.) 

VERTT 

vertical  totals 

T700 

700  mb  temp  (deg.  C) 

CROSS 

cross  totals 

TD700 

700  mb  dewpoint  (deg.  C) 

SWEAT 

SWEAT  index 

TW700 

700  mb  wetbulb  temp  (deg.  C) 

KI 

K  index 

TE700 

700  mb  theta-E  (deg.  K) 

IN13 

(unknown) 

HT700 

700  mb  height  (meters,  AGL) 

CAP 

cap  strength  (deg.  C) 

FF700 

700  mb  wind  speed  (knots) 

LAPSE 

lapse  rate  (700-500  mb)  (deg.  C/km) 

DD700 

700  mb  wind  direction  (deg.) 

MAXT 

maximum  temp  forecast  (deg.  F) 

T500 

500  mb  temp  (deg.  C) 

DD06 

0-6  km  AGL  mean  wind  direction  (deg.) 

TD500 

500  mb  dewpoint  (deg.  C) 

FF06 

0-6  km  AGL  mean  wind  speed  (knots) 

TW500 

500  mb  wetbulb  temp  (deg.  C) 

DDSTRM 

storm  motion  (direction,  deg.) 

TE500 

500  mb  theta-E  (deg.  K) 

FFSTRM 

storm  motion  (speed,  knots) 

HT500 

500  mb  height  (meters,  AGL) 

HEL02 

0-2  km  AGL  total  helicity  (m/sec)2 

FF500 

500  mb  wind  speed  (knots) 

HEL03 

0-3  km  AGL  total  helicity  (m/sec)2 

DD500 

500  mb  wind  direction  (deg.) 

SHR02 

0-2  km  AGL  total  shear  (10'3  sec'1) 

T300 

300  mb  temp  (deg.  C) 

SHR03 

0-3  km  AGL  total  shear  (10'3  sec'1) 

TD300 

300  mb  dewpoint  (deg.  C) 

SHRBRN 

BRN  shear  (m/sec)2 

TW300 

300  mb  wetbulb  temp  (deg.  C) 

EH3 

energy/helicity  index 

TE300 

300  mb  theta-E  (deg.  K) 

DDSR 

0-3  km  AGL,  SR  directional  change  (deg.) 

HT300 

300  mb  height  (meters,  AGL) 

TEI 

theta-E  index  (deg.  C) 
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Table  1. 

continued 

TEMAXT 

max  theta-E  (deg.  K) 

FF500 

mean  inflow  speed  (knots) 

TEMAXP 

max  theta-E  pressure  (mb) 

DD500 

mean  inflow  direction  (deg.) 

TEIMINT 

min  theta-E  (deg.  K) 

VH1000 

horiz.  vort.,  sfc  to  1000  m  AGL  (10'3  sec'1) 

TEDvUNP 

min  theta-E  pressure  (mb) 

VS1000 

strmwse  vort.,  sfc  to  1000  m  AGL  (10"3  sec'1) 

IN35 

(unknown) 

FF1000 

mean  inflow  speed,  sfc  to  1000  m  AGL  (knots) 

PSFC 

surface  pressure  (mb) 

DD1000 

mean  inflow  direction,  sfc  to  1000  m  AGL  (deg.) 

PMAX 

Wmax  pressure  (mb) 

VH1500 

horiz.  vort.,  sfc  to  1500  m  AGL  (10'3  sec'1) 

LCLP 

LCL  pressure  (mb) 

VS 1500 

strmwse  vort.,  sfc  to  1500  m  AGL  (10'3  sec"1) 

WBZP 

wetbulb  zero  pressure  (mb) 

FF1500 

mean  inflow  speed,  sfc  to  1500  m  AGL  (knots) 

WBZT 

wetbulb  zero  height  (ft,  AGL) 

DD1500 

mean  inflow  direction,  sfc  to  1500  m  AGL  (deg.) 

MELP 

negative  equilibrium  Ivl  pres.(mb) 

VH2000 

horiz.  vort.,  sfc  to  2000  m  AGL  (10'3  sec'1) 

MPLP 

maximum  parcel  level  pressure  (mb) 

VS2000 

strmwse  vort.,  sfc  to  2000  m  AGL  (10'3  sec'1) 

ELHT 

equilibrium  level  height  (ft.  AGL) 

FF2000 

mean  inflow  speed,  sfc  to  2000  m  AGL  (knots) 

MPLHT 

max.  parcel  level  height  (ft,  AGL) 

DD2000 

mean  inflow  direction,  sfc  to  2000  m  AGL  (deg.) 

RHEL2 

rel.  helicity,  sfc  to  2  km  AGL  (m/sec)2 

VH2500 

horiz.  vort.,  sfc  to  2500  m  AGL  (10'3  sec'1) 

SHRPM2 

positive  shear,  sfc  to  2  km  AGL(10'3  sec*1) 

VS2500 

strmwse  vort.,  sfc  to  2500  m  AGL  (10'3  sec’1) 

SHR2 

neg.  shear,  sfc  to  2  km  AGL  (10'3  sec'1) 

FF2500 

mean  inflow  speed,  sfc  to  2000  m  AGL  (knots) 

HELPM2 

pos.  helicity,  sfc  to  2  km  AGL  (m/sec)2 

DD2500 

mean  inflow  direction,  sfc  to  2500  m  AGL  (deg.) 

HELM2 

neg.  helicity,  sfc  to  2km  AGL  (m/sec)2 

VH3000 

horiz.  vort.,  sfc  to  3000  m  AGL  (10‘3  sec'1) 

RHEL4 

rel.  helicity,  sfc  to  4  km  AGL  (m/sec)2 

VS3000 

strmwse  vort.,  sfc  to  3000  m  AGL  (10‘3  sec’1) 

SHRPM4 

pos.  shear,  sfc  to  4  km  AGL  (10’3  sec'1) 

FF3000 

mean  inflow  speed,  sfc  to  3000  m  AGL  (knots) 

SHRM4 

neg.  shear,  sfc  to  4  km  AGL  (10‘3  sec'1) 

DD3000 

mean  inflow  direction,  sfc  to  2500  m  AGL  (deg.) 

HELPM4 

pos.  helicity,  sfc  to  4  km  AGL  (m/sec)2 

HELM4 

neg.  helicity,  sfc  to  4  km  AGL  (m/sec)2 

RHEL6 

rel.  helicity,  sfc  to  6  km  AGL  (m/sec)2 

SHRPM6 

pos.  shear,  sfc  to  6  km  AGL  (10"3  sec'1) 

SHRM6 

neg.  shear,  sfc  to  6  km  AGL  (lO^sec'1) 

HELPM6 

pos.  helicity,  sfc  to  6  km  AGL  (m/sec)2 

HEL6 

neg.  helicity,  sfc  to  6  km  AGL  (m/sec)2 

VH500 

horiz.  vort.,  sfc  to  500  m  AGL  (10'3  sec'1) 

VS500 

strmwise  vort,  sfc  to  500m  AGL  (1  O'3  sec'1) 
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A  subset  of  the  parameters  in  Table  1  was  constructed  after  determining  the  most  promising 
predictors,  and  then  stepwise  regression  was  performed  on  the  predictor/predictand  data  set  to 
obtain  predictive  equations.  Before  discussing  how  the  predictand  was  determined  and  produced, 
first  we  discuss  how  our  pool  of  predictors  was  generated. 

2.1  SHARP  Program 

The  SHARP  program  was  chosen  to  generate  predictors  for  this  study  because  of  its  ease  of 
use  and  plentiful  choice  of  output  parameters.  As  long  as  upper  air  sounding  data  is  formatted 
correctly,  output  can  be  generated  relatively  easily  and,  if  desired,  copied  to  a  database  file  for 
statistical  analysis.  Soundings  can  also  be  displayed  and  edited  if  desired  and  a  generous  sample 
of  derived  output  parameters  can  be  displayed  for  quick  analysis.  It  is  considered  an  ideal  tool  for 
researchers. 

The  challenge  was  formatting  a  large  volume  of  soundings  into  the  desired  configuration  (see 
Figure  1).  SHARP  requires  height  (m),  temperature  (deg.  C),  dewpoint  temperature  (deg.  C), 
wetbulb  temperature  (deg.  C),  wetbulb  potential  temperature  (deg.  C),  wind  direction  (deg.)  and 
wind  speed  (knots),  at  the  surface  and  interpolated  every  50  mb  up  to  100  mb.  SHARP  also 
requires  pressure  (mb),  wind  direction  (deg.),  and  wind  speed  (knots)  at  the  surface  and 
interpolated  every  500  m  up  to  9000  m;  and  finally  storm  direction  (deg.)  and  speed  (knots).  As 
our  RAOB  data  were  not  collected  from  the  AFOS  system  which  is  the  target  operational  system 
for  which  SHARP  was  designed  and  implemented,  but  rather  from  the  database  archive  at  AF 
Combat  Climatology  Center  (AFCCC),  it  was  necessary  to  reformat  the  data  from  DATSAV2  to 
SHARP  text  file  format.  The  files  were  then  moved  or  copied  into  the  C:\sharp\raobs  directory 
(for  Pcs)  for  processing.  SHARP  processing  methodology  is  fully  described  in  the  SHARP 
Workstation  vl.50  Users  Manual.  After  specifying  the  number  and  types  of  database  elements 
and  then  databasing  (a)  sounding(s),  the  output  can  be  viewed  directly  in  C:\sharp\dbasel.txt  or 
input  to  an  Excel  spreadsheet  for  editing  and  further  processing. 
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Figure  1.  Sample  SHARP-Formatted  Upper-Air  RAOB  Observation  for  LBF,  North  Platte, 
NE  on  4  Oct  89  at  00Z. 
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2.2  Upper  Air  Soundings 


Efficient  reformatting  of  DATSAV2  upper  air  sounding  data,  for  SHARP  use,  required 
writing  Visual  Basic  macro  code  to  reformat  all  upper  air  data  available  during  the  months  of 
May,  July,  and  September  1992  and  1993  for  the  71  sites  chosen  for  this  study.  These  sites  were 
selected  due  to  data  availability.  The  71  station  sites  were  then  separated  into  12  regions  (see 
Figure  2),  in  agreement  with  categorization  of  the  CONUS  by  a  research  team  doing  similar  work 
to  predict  other  present  weather  and  surface  visibility.  A  list  of  the  stations  in  each  region  is 
shown  in  Table  2. 

Decisions  were  made  to  use  as  many  soundings  as  possible  while  maintaining  data  integrity 
(2,130  soundings  out  of  25,560,  or  8.3%,  were  discarded).  There  had  to  be  at  least  15  groups 
(representing  levels)  of  data;  otherwise,  the  sounding  was  not  used.  If  there  was  no  data  above 
200  mb,  the  sounding  was  not  used.  Likewise,  there  had  to  be  some  data  present  in  the  lowest 
150  mb.  The  amount  of  missing  data  required  us  to  make  other  assumptions: 

1)  Extrapolate  data  down  to  surface  level  to  fill  in  missing  surface  T,  TD.  The  standard  lapse 
rate  was  used  to  help  approximate  surface  temperature.  For  morning  soundings,  this  ignores 
potential  radiation  inversion,  which  results  in  a  more  unstable  profile  for  the  few  soundings 
affected.  The  hydrostatic  approximation  was  used  to  replace  missing  surface  pressure; 

2)  Pull  down  wind  direction  and  speed  from  one  of  the  next  six  highest  data  groups  to  fill 
missing  surface  wind  direction  and  speed; 

3)  Pull  up  dewpoint  from  a  lower  level  to  fill  in  300  mb  TD; 

4)  Pull  up  wind  direction  and  speed  from  one  of  the  next  six  highest  data  groups  to  fill  in  100 
mb  wind  direction  and  speed.  Only  temperature  and  dewpoint  were  extrapolated,  by  using 
standard  lapse  rate. 
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Figure  2,  Twelve  CONUS  Climatological  Regions. 


Table  2.  List  of  71  RAOB  Station  Sites  in  Each  of  12  CONUS  Regions 


REGION  1  (WEST  COAST) 

119.11W 

2  (M) 

SA.RB 

NlTD 

723910 

PT  MUGU  (NAWS) 

CA 

34.1  IN 

OAK 

724930 

OAKLAND 

CA 

37.73N 

12221W 

3  (M) 

SA.R6 

SLE 

726940 

SALEM/MCNARY 

OR 

44.91  N 

123.00W 

61  (M) 

SARB.SY 

UIL 

727970 

QUILLAYUTE  STATE 

WA 

47.95N 

124.55W 

62  (M) 

SARB.SY 

VBG 

723930 

VAN0ENB6RG  AF8 

CA 

34.73N 

120.58W 

112  (M) 

SA.RB.RR 

REGION  2  (INTER MOUNTAIN  WEST) 

874  (M) 

SA.RB.TJ.GU.MO.SY 

SOI 

726810 

BOISE  MUNICIPAL 

ID 

43.56N 

1 16.21  W 

DRA 

723870 

MERCURY/DESERT  ROCK 

NV 

36.  SIN 

1 16.01  W 

1009  (M) 

SARB 

EDW 

723810 

EDWARDS  AFB 

CA 

34.90N 

117.88W 

702  (M) 

SA.R8.GU 

ELY 

724860 

ELY/YELLAND  FLD 

NV 

39.28N 

114.85W 

1909  (M) 

SARB.SY 

GEG 

727850 

SPOKANE  INTI  ARPT 

WA 

47.63N 

117.53W 

721  (M) 

SAR8.TJ.GU.MO.SY 

GJT 

724760 

GRAND  JUNCTION 

CO 

39.1  IN 

108.53W 

1475  (M) 

SA.RB.MO.SY 

GTF 

727750 

GREAT  FALLS  INTL 

MT 

47.48N 

111.36W 

1115  (M) 

SA.RB.TJ.GU.SY 

LND 

725760 

LANOER/HUNT  FIELD 

WY 

420S1N 

108.73W 

1634  (M) 

SA.RB.TJ.SY 

MFR 

725970 

MEDFORD/JACKSON  CO. 

OR 

4236N 

12286W 

405  (M) 

SA.RR.RB.TJ.GU.MO.SY 

SLC 

725720 

SALT  LAKE  CITY  INTL 

UT 

40.78N 

111.96W 

1288  (M) 

SA.RB.TJ.GU.MO.SY 

WMC 

725830 

WINN6MUCCA  MUNI 

NV 

40.90N 

117.80W 

1315  (M) 

SA.RB.SY 

REGION  3  (SOUTHWEST) 

SA.RR.RB.UGU.MO.SY 

ABQ 

723650 

ALBUQUERQUE  !NTL 

NM 

35.05N 

106.61  W 

1620  (M) 

ELP 

722700 

EL  PASO  INTL  ARPT 

TX 

31. 80N 

106.40W 

1194  (M) 

SA.RB.TJ.GU.SY 

FHU 

722730 

FORT  HUACHUCA/LI8BY 

AZ 

31.60N 

110.35W 

1438  (M) 

SARB 

INS 

746140 

INDIAN  SPRINGS  GNRY 

NV 

36.58N 

115.66W 

952  (M) 

SA.RB 

TUS 

722740 

TUCSON  INTL  ARPT 

AZ 

3211N 

110.93W 

779  (M) 

SA.RR.RB.MO.SY 

REGION  4  (PANHANDLE) 

SA.RR.RB.MO.SY 

AMA 

723630 

AMARILLO  ARPT  (AWOS) 

TX 

35.23N 

101.70W 

1099  (M) 

DDC 

724510 

DODGE  CITY  (AWCS) 

KS 

37.76N 

99.96W 

790  (M) 

SARR.RB.TJ.GU.SY 

DRT 

722610 

DEL  RIO  INTL  (AUT) 

TX 

29.36N 

100.91W 

307  (M) 

SA.RB.GU.SY 

MAF 

722650 

MIDLAND  REGIONAL 

TX 

31.95N 

102.18W 

872  (M) 

SARR.R8.SY 

REGION  5  (MIDWEST  &  NORTHERN  TIER) 

SA.RR.RB.TJ.GU.SY 

BIS 

727640 

BISMARCK  MUNICIPAL 

NO 

46.76N 

100.75W 

506  (M) 

BNA 

723270 

NASHVILLE  METRO 

TN 

36.13N 

86.68W 

180  (M) 

SA.RR.RB.GU.MO.SY 

CAY 

724290 

DAYTON/JAMES  M.  COX 

OH 

39.90N 

84.20W 

306  (M) 

SAR8.GU.MO.SY 

DEN 

725650 

DENVER  INTL  ARPT 

CO 

39.86N 

104.66W 

1656  (M) 

SA.SY 

GGW 

727680 

GLASGOWINTLARPT 

MT 

48.21N 

106.61W 

700  (M) 

SARB.SY 

HON 

726540 

HURON  REGIONAL 

SD 

44  38N 

98.21W 

393  (M) 

SA.RR.RB.SY 

IBF 

725620 

N.  plattea.ee  bird 

NE 

41.13N 

100.68W 

849  (M) 

SARR.RB.TJ.GU.SY 

OMA 

725500 

OMAHA/EPPLEY  FLD 

NE 

41.30N 

95.  SOW 

299  (M) 

SA.RR.GU.MO.SY 

PAH 

724350 

PADUCAH/BARKLEY 

KY 

37.06N 

88.76W 

126  (M) 

SA.RR.RB 

PIA 

725320 

PEORIA  MUNICIPAL 

IL 

40.66N 

89.68W 

202  (M) 

SA.RB.TJ.SY 

RAP 

726620 

RAPID  CITY  REGIONAL 

SD 

44  05N 

103.06W 

966  (M) 

SARR.RB.TJ.GU.SY 

STC 

726550 

ST  CLOUD  MUNICIPAL 

MN 

45.55N 

94.06W 

312  (M) 

SARB 

TOP 

724560 

TOPEKA/BILLARD  MUNI 

KS 

39.06N 

95.61W 

270  (M) 

SARR.RB.TJ.GU.SY 

UMN 

723440 

MONET 

MO 

36.88N 

93.  SOW 

437  (M) 

RR.RB.TJ.SY 

REGION  6  (SOUTHWEST  INTERIOR) 

87.25W 

140  (M) 

SA.RR.RB 

CKL 

722290 

CENTR EVILLE/BIBB  CO. 

AL 

3290N 

GGG 

722447 

LONGVIEW/GREGG  CO. 

TX 

32.38N 

94.71W 

111  (M) 

SA.RR 

GSO 

723170 

GREENSBORO/PIEDMONT 

NC 

36.08N 

79.95W 

270  (M) 

SARB.TJ.MO.SY 

JAN 

722350 

JACKSON/THOMPSON 

MS 

3221 N 

90.08W 

100  (M) 

SARR.RB.TJ.GU.SY 

LZK 

723400 

NORTH  LITTLE  ROCK 

AR 

34.83N 

9225W 

165  (M) 

RR.RB.SY 

OUN 

723570 

NORMAN/WESTHBMER 

OK 

35.21  N 

97.45W 

357  (M) 

RB 

SEP 

722600 

STEPHENVILLBCLARK 

TX 

3221 N 

98.18W 

402  (M) 

SA,RR,RB 

REGION  7  (GULF  COAST) 

SARR.RB.TJ.GU.SY 

BRO 

722500 

BROWNSVILLE  INTL 

TX 

25.  SON 

97.43W 

6<M) 

CRP 

722510 

CORPUS  CHRIST!  INTL 

TX 

27.76N 

97.50W 

13  (M) 

SARR.RB.SY 

LCH 

722400 

LAKE  CHARLES  MUNI 

LA 

30.1  IN 

93.21W 

10  (M) 

SARR.RB.SY 

UX 

722330 

SUOELL  MUNICIPAL 

LA 

30.33N 

39.81  W 

8<M) 

RB 

TBW 

722100 

TAMPA  BAY  AREA 

FI 

27.70N 

8240W 

13  (M) 

RR.RB 

TLH 

722140 

TALLAHASSEE  RGNL 

FL 

30.38N 

84.36W 

21  (M) 

SARB.TJ.GU.SY 

REGION  8  (SOUTHEAST  COAST) 

SARR.RB 

AYS 

722130 

WAYCROSS/WARE  CO. 

GA 

31.25N 

8240W 

46  (M) 

CHS 

722080 

CHARLESTOWN  MUNI 

SC 

3290N 

30.03W 

15  (M) 

SARR.RB.SY 

HAT 

723040 

CAPE  HATTERAS 

NC 

35.26N 

75.55W 

3{M) 

SARR.RB.GU.SY 

PBI 

722030 

WEST  PALM  BEACH 

a 

26.68N 

80.1 1W 

6(M) 

SARR.RB.SY 

REGION  9  (GREAT  LAKES) 

0UF 

725280 

BUFFALO  INTL  ARPT 

NY 

4293N 

78.73W 

215  (M) 

SARR.RB.TJ.GU.MO.SY 

FNT 

726370 

FUNT/BISHOP  INTL 

Ml 

4296N 

83.75W 

233  (M) 

SARB.TJ.MO.SY 

GRB 

726450 

GREEN  BAY/STRAUBEL 

W1 

44.48N 

88.13W 

214  (M) 

SARB.TJ.SY 

INL 

727470 

INTERNATIONAL  FALLS 

MN 

48.56N 

93.38W 

361  (M) 

SARB.TJ.GU.SY 

SSM 

727340 

SAULTSTE  MARIE 

MI 

46.46N 

84.36W 

221  (M) 

SARB.TJ.GU.SY 

REGION  10  (APPALACHIANS) 

SARR.RB.SY 

AHN 

723110 

ATHENS  MUNICIPAL 

GA 

33.95N 

83.31  W 

247  (M) 

ALB 

725180 

ALBANY  COUNTY  ARPT 

NY 

42.75N 

73.80W 

89  (M) 

SARR.RB.TJ.GU.SY 

CAR 

727120 

CARIBOU  MUNICIPAL 

ME 

46.86N 

68.01W 

190  (M) 

SARB.TJ.GU.SY 

HTS 

724250 

HUNT1NGTON/TR!  STATE 

WV 

38.36N 

6255W 

255  (M) 

SARB.SY 

PIT 

725200 

PITTSBURGH  INTL 

PA 

40.  SON 

0O.21W 

373  (M) 

SARR.RB.TJ.GU.MO.SY 

REGION  11  (NORTHEAST  COAST) 

ACY 

702470 

ATLANTIC  CITY  INTL 

NJ 

39.45N 

74.56W 

20  (M) 

SARB.RR 

LAD 

724030 

WASH1NGT  ON/DULLES 

VA 

28.95N 

77.45W 

98  (M) 

SARB.MO.SY 

PWM 

726060 

PORTLAND  INTL 

ME 

43.65N 

70.31  W 

19  (M) 

SA.RR.RB.TJ.GU.SY 

WAL 

724020 

WALLOPS  ISLAND 

VA 

37.93N 

75.48W 

41  (M) 

SARB 

REGION  12  (SOUTHEAST) 

EYW 

722010 

KEY  WEST  INTL 

FL 

24.55N 

81.75W 

6  (M) 

SARR.RB.SY 
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5)  Interpolate  between  layers  to  fill  in  missing  T,  TD,  TW,  FF,  and  DD;  and 

6)  Highest  data  group  needs  to  be  higher  than  200  mb.  Since  SHARP  requires  data  up  to 
and  including  100  mb,  one  wouldn’t  want  to  extrapolate  across  a  5  to  6  km  vertical  extent  m  the 
top  of  the  sounding,  much  less  in  the  lower  part  of  the  sounding. 

Once  the  steps  listed  above  were  accomplished,  and  RAOB-level  data  filled  in,  the  second 
interpolation  was  carried  out  to  provide  SHARP-level  data  for  pressure  levels  shown  in  the  upper 
half  of  Figure  1.  Interpolation  required  use  of  standard  logarithms,  since  data  was  plotted  on 
Skew-T/log  P  diagrams. 

Producing  data  in  similar  format  as  the  bottom  part  of  Figure  1  required  interpolation  of  T, 
DD,  and  FF  to  the  desired  height  levels,  which  was  accomplished  in  much  the  same  way  as  the 

step  described  above. 

Hand-calculated  interpolations  and  manual  Skew-T/log  P  plots  of  sample  RAOB  raw 
sounding  data  agreed  with  similar  calculations  and  plots  of  analogous  SHARP-formatted  data 
sets,  thus  verifying  our  interpolation  algorithm.  A  comparison  of  a  sample  RAOB  and  SHARP- 
formatted  sounding  is  shown  in  Figure  3.  The  sample  RAOB  is  shown  by  solid  lines,  and  the 
SHARP  sounding  (dashed  lines)  adds  a  dewpoint  at  300  mb  and  temperature  and  dewpoint  at  the 
surface  for  this  example,  which  is  the  3  May  92  00Z  Rapid  City,  South  Dakota  profile.  As  one 
might  expect,  the  derived  soundings  are  biased  by  being  too  moist  at  300  mb  and/or  at  the 
surface,  where  our  algorithm  needed  to  fill  in  missing  data.  However,  from  having  looked  at  a 
large  sampling  of  the  data,  this  was  the  case  in  only  a  small  minority  of  the  soundings.  We 
conclude  that  the  hydrostatic  approximation  in  calculating  surface  temperature,  together  with 
dropping  down  TD  from  a  higher  level  would  result  in  added  surface  layer  instability,  thus 

potentially  leading  to  overforecasting  storm  occurrence. 

Only  00Z  and  12Z  soundings  were  used,  and  any  18Z  soundings  discarded,  producing  a 
maximum  of  25,560  soundings  (two  soundings  per  day,  up  to  30  days  per  month,  three  months 
per  year,  two  years,  and  71  RAOB  stations,  minus  the  unusable  RAOBs).  The  actual  number  of 
soundings  available  after  editing  and  used  for  statistical  analysis  was  23,430.  By  region,  the 
totals  are  shown  in  Table  3. 
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Figure  3.  Comparison  of  Sample  RAOB  and  SHARP-Formatted  Sounding . 


Table  3.  Number  of  RAOB  Soundings  Produced  for  Each  of  12  CONUS  Regions 


Region  1 
Region  2 
Region  3 
Region  4 
Region  5 
Region  6 
Region  7 
Region  8 
Region  9 
Region  10 
Region  11 
Region  12 

Total 


1992 

(May,  July,  Sep.) 

674 

1812 

721 

648 

2427 

1173 

1016 

683 

840 

841 
660 
165 


1993 

(May,  July,  Sep) 

681 

1783 

726 

698 

2409 

1200 

1032 

701 

857 

833 

679 

171 


11,660  U>770 


Total 

Percent 

1355 

5.8 

1395 

15.3 

1447 

6.2 

1346 

5.8 

4836 

20.6 

2373 

10.1 

2048 

8.7 

1384 

5.9 

1697 

7.2 

1674 

7.1 

1339 

5.7 

336 

1.4 

23,430 

99.8 

3.  PREDICTANDS 


Predictands  used  overall  and  for  each  region  are  illustrated  in  Figures  4-16. 

For  these  figures,  LTG  (lightning),  OBS  (surface  station  observations),  RAD  (manually 
digitized  radar),  STORM  (NCDC  storm  reports),  *OBS  (surface  station  obs  of  severe  storms), 
•RAD  (manually  digitized  radar  obs  of  severe  storms),  and  ‘STORM  (NCDC  severe  storm 
reports)  are  raw  storm  predictands,  and  HITS  (storm  w/i  33  s.m,  and  ±  1  hour),  sv  HITS  (same  as 
HITS  but  for  severe  storm),  HITM  (storm  w/i  66  s.m.  and  ±  3  hours),  sv  HITM  (same  as  HITM 
but  for  severe  storm),  and  HITL  (storm  w/i  100  s.m.  and  ±  6  hours),  sv  HITL  (same  as  HITL  but 
for  severe  storm)  are  derived  predictands.  Derived  predictands  are  dependent  on  any  storm 
occurrence  (using  LTG,  OBS,  RAD,  STORM)  and/or  severe  storm  occurrence  (ustng  *OBS, 
•RAD,  *  STORM).  The  -S,  -M,  and  -L  suffixes  represent  a  firrther  categorization,  described  in 

detail  later  in  this  section. 
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Figure  5.  Same  as  in  F: 
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Figure  6.  Same  as  in  Figure  4  but  for  Region  2  (Intermountain  West). 
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Figure  7.  Same  as  in  Figure  4  but  for  Region  3  (Southwest). 
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Figure  8.  Same  as  in  Figure  4  but  for  Region  4  (Panh 
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Figure  9.  Same  as  in  Figure  4  but  for  Region  5  (Midwest’s  Northern  Tier 
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Figure  10.  Same  as  in  Figure  4  but  for  Region  6  (Southwest  Interior). 
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Figure  11.  Same  as  in  Figure  4  but  for  Region  7  (Gulf  Coast). 
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Figure  12.  Same  as  in  Figure  4  but  for  Region  8  (Southeast  Coast). 
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in  Figure  4  but  for  Region  10  (Appalachians). 


3.1  Thunderstorm  Occurrence 

All  thunderstorm  and  severe  storm  occurrence  are  the  predictands,  and  the  types  of  data  used 
to  construct  the  set  of  predictands  are  shown  in  Table  4.  Severe  storms  were  characterized  as 
storms  exhibiting  funnel  clouds  or  tornadoes,  hail  0.75  in.  or  larger,  wind  50  knots  or  greater, 
and/or  heavy  rain. 

Table  4.  Types  of  Data  Used  to  Generate  Predictands. 

Surface  Observations 
Manually-digitized  radar  (MDR) 

Lightning 

STORM  SUMMARY 
*Only  available  for  1993. 

3.1.1  Manually  Digitized  Radar  (MDR)  Data 

Much  less  MDR  data  was  available  for  this  study  than  expected.  MDR  data  was  available  for 
approximately  20  percent  of  the  days  in  the  period  of  interest.  However,  the  agreement  with 
lightning  data  was  excellent,  and  so  we  decided  to  use  as  much  of  the  data  as  was  available  to  us. 
Severe  storms  were  defined  as  those  with  MDR  pixels  exhibiting  rainfall  rate  echo  intensities  of 
strong,  very  strong,  intense,  or  extreme  (digital  video  integrator  and  processor  (DVIP)  level  of  3 
or  greater).  Storm  activity  in  general  was  defined  as  DVIP  levels  1-6  inclusive  (we  made  a 
judgment  call  that  DVIP  level  1,  weak  intensity,  indicated  thunderstorm  occurrence). 

3.1.2  Surface  Observations 


Severe  +  Nonsevere 

Severe 

yes 

yes 

yes* 

yes* 

yes 

no 

yes 

yes 

Station  observations  were  obtained  in  DATSAV2  surface  data  fixed  length  format  and 
analyzed  for  any  type  of  storm  activity.  Severe  storms  were  those  with  weather  type  encoded  as 
19  (funnel/tomado),  27  (hail),  97  (heavy  tstm  w/o  hail),  99  (heavy  tstm  w/hail),  and/or  wind  speed 
of  263  tenths  of  meters  per  second  or  greater  (indicating  wind  speed  of  50  knots  or  greater). 
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3.1.3  Lightning  Flash  Data 


We  relied  most  heavily  on  Global  Atmospheric  lightning  flash  data,  taken  over  the  National 
Lightning  Detection  Network  and  obtained  through  AFCCC,  OL-A,  for  detection  of  storm 
activity.  It  was  formatted  on  OL-A  data  field  structure,  and  contained  an  overwhelming  amount 
of  detailed  flash  data,  which  had  to  be  edited  for  our  use.  An  efficient  software  parsing  and 
editing  routine  was  employed  to  reduce  this  voluminous  data  set.  The  area  encompassed  by  the 
lightning  predictand  was  in  the  shape  of  a  square  not  a  circle,  and  this  represents  what  we 
estimate  as  the  greatest  source  of  error  in  this  study. 

3.1.4  National  Climatic  Data  Center  (NCDC)  Storm  Data 

NCDC  storm  data  was  reviewed  manually  by  county  and  state  and  represents  the  most  reliable 
source  of  data  for  severe  storm  detection.  Since  we  were  initially  relying  on  both  MDR  data  and 
Storm  Data  for  severe  storm  identification,  our  discovery  of  the  MDR  data  shortage  left  us  with 
about  half  of  the  severe  storm  information  we  expected  to  have. 

We  pooled  predictands  into  3  types,  to  help  set  the  stage  for  eventual  evaluation  of 
predictions  using  model  forecast  data.  Ostensibly,  the  area  of  storm  detection  should  equal  model 
grid  size;  however,  we  also  produced  two  other  predictand  sets  for  comparison  purposes. 

The  predictand  types  are  as  shown  in  Table  5,  and  represent  both  a  circular  areal  extent  around 
RAOB  sites  and  a  temporal  period  about  RAOB  synoptic  times  (00Z  and  12Z).  The  spatial 
scheme  of  100  statute  miles  (s.m.)  is  similar  in  area  to  that  used  by  Charba  (1979)6.  A  radius  of 
100  s.m.  is  roughly  equivalent  to  one-half  the  average  spacing  of  upper  air  observation  sites 
across  the  CONUS  (one-  half  of  400  km)  or  144  s.m.  (Maddox,  1980)7.  Figure  17  shows  “L” 
areal  coverage  around  each  RAOB  site.  Figure  18  shows  the  temporal  coverage  about  RAOB 
launch  time.  Last,  Table  6  shows  the  number  of  storm-related  RAOBs,  both  severe  and  overall 
(including  severe). 
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Table  5.  Fredictand  Types 


Predictand  “Type” 

S(mall) 

M(edium) 

L(arge) 

Detection  w/i  circle  of  radius  &  (statute  miles): 

33 

66 

100 

Detection  w/i  ±  h  (hours): 

1 

3 

6 

Table  6.  Number  (Percent)  of  Storm-Related  RAOBs 


Region 

1992-1993 

% 

%of 

1992-1993 

% 

%of 

1992-1993 

% 

Overall/Total 

Total 

Severe/Total 

Total 

Severe/Storms 

1 

93/1355 

(6.8) 

0.4 

8/1355 

(•6) 

0.03 

8/93 

(8.6) 

2 

1166/3595 

(32.4) 

4.9 

89/3595 

(2.5) 

0.38 

89/1166 

(7.6) 

3 

563/1447 

(38.9) 

2.4 

58/1447 

(4.0) 

0.24 

58/563 

(10.3) 

4 

650/1346 

(48.3) 

2.8 

214/1346 

(15.9) 

0.91 

214/650 

(32.9) 

5 

2301/4836 

(47.6) 

9.8 

739/4836 

(15.3) 

3.10 

739/2301 

(32.1) 

6 

1270/2373 

(53.5) 

5.4 

379/2373 

(15.9) 

1.50 

379/1270 

(29.8) 

7 

1058/2048 

(51.7) 

4.5 

356/2048 

(17.4) 

1.50 

356/1058 

(33.6) 

8 

712/1384 

(51.4) 

3.0 

271/1384 

(19.5) 

1.15 

271/712 

(38.1) 

9 

522/1697 

(30.7 

2.2 

119/1697 

(7.0) 

0.51 

119/522 

(22.7) 

10 

600/1674 

(35.8) 

2.6 

190/1674 

(11.4) 

0.81 

190/600 

(31.6) 

11 

476/1339 

(35.5) 

2.0 

175/1339 

(13.1) 

0.74 

175/476 

(36.7) 

12 

-  199/336 

(59.2) 

0.8 

77/336 

(22.9) 

0.33 

77/199 

(38.6) 

Total 

9610/23430 

(41.0) 

2675/23430 

(11.4) 

2675/9610 

(27.8) 

3.2  Maximum  Storm  Cloud  Top  Height 

Equilibrium  Level  (EL)  height  was  used  as  a  predictand  to  estimate  maximum  storm  cloud  top 
height.  EL  is  the  height  where  the  temperature  of  a  buoyantly  rising  parcel  again  becomes  equal 
to  the  temperature  of  the  environment.  This  predictand  was  developed  using  only  the  “L” 
predictand  data  pool,  including  both  non-severe  and  severe  storms. 

No  additional  data  processing  was  necessary  for  this  part  of  the  study,  because  EL  height  had 
been  one  of  the  predictors  for  storm  occurrence.  Using  EL  height  as  a  predictand  meant  using  a 
non-binary  predictand  and  ignoring  those  soundings  not  characterized  by  an  EL  (or  where  EL 
height  is  zero). 


28 


29 


Figure  17.  Spatial  Coverage  Around  Each  EAOB  Station  Site  for  the  “L”  Predictors. 
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Figure  18.  Temporal  Coverage  Used  for  Each  RAOB  Station  Site. 


4.  CORRELATION  COEFFICIENTS 

Shown  in  Figures  19-57  are  correlation  coefficients  for  the  top  36  predictors  (all  storm  and 
severe  storm  cases)  and  each  of  the  three  predictand  types,  for  each  of  the  12  CONUS  regions 
and  the  CONUS  overall.  The  equations  presented  in  Section  6  were  developed  using  the  top  ten 
predictors  for  the  respective  regions  and  predictand  types  and  were  identified  using  these  charts. 
Although  the  predictors  are  listed  in  the  same  order  on  each  chart,  this  is  the  SHARP-specified 
order  in  which  predictors  were  databased.  Note  that  one  of  the  predictors,  IN35,  is  not  defined  in 
this  report-the  authors  of  SHARP  were  not  able  to  reconstruct  its  identity.  It  does  appear  to  be 
useful  in  some  cases,  and  if  identified,  we  will  make  an  addendum  to  this  report. 

This  is  a  good  place  to  note  that  when  referring  to  these  figures,  one  should  be  aware  that  not 
all  totals  are  additive.  For  example,  *STORM  and  *OBS-L  are  not  mutually  exclusive. 

A  striking  characteristic  are  those  charts  showing  predictors  more  highly  correlated  to  severe 
storms  than  to  severe  and  non-severe  storms.  This  occurs  markedly  for  Region  1-100  (Region  1, 
100  s.m.  predictand  type)  and  Region  2-66,  and  more  dramatically  for  Regions  4-33,  5-100,  6- 
100,  7-66,  8-66,  9-100,  10-100,  11-33,  and  12-66.  This  may  be  partly  explained  by  the  relatively 
low  number  of  RAOBs  for  several  regions;  the  five  regions  with  the  lowest  RAOB  data  pools  (see 
Table  3)  were  Regions  1  (5.8%),  4  (5.8%),  8  (5.9%),  11  (5.7%),  and  12  (1.4%).  Figures  58 
through  63  also  shed  some  light.  For  Region  1,  there  were  only  8  severe  storms  and  93  storms  of 
any  kind.  The  low  number  of  Region  1  predictands  is  evident  in  Figures  58-63.  Region  2  had 
1166  out  of  3595  storm-related  RAOBs  (32.4%),  which  was  not  particularly  low  but  only  89  out 
of  3595  RAOBs  that  were  severe  storm-related  ( 2.5%).  Comparing  Figures  58  and  59  illustrates 
this. 
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Figure  19.  Correlation  Coefficients  for  Overall  Case  Within  100  Statute  Mile  Radius. 
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Figure  21.  Same  as  Figure  19  but  for  33  Statute  Mile  Radius. 
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Figure  22.  Correlation  Coefficients  for  Region  1  (West  Coast)  Within  100  Statute  Mile  Radius. 
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Figure  23.  Same  as  Figure  22  but  for  66  Statute  Mile  Radius. 
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Figure  24.  Same  as  Figure  22  but  for  33  Statute  Mile  Radius. 
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Figure  25.  Correlation  Coefficients  for  Region  2  (Intermountain  West)  Within  100  Statute  Mile  Radius. 
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Figure  26.  Same  as  Figure  25  but  for  66  Statute  Mile  Radius. 


0.200 


liidpyjdoo  uoiieiauoo 


2 

o 

o 

£ 

Q. 


40 


Figure  27.  Same  as  Figure  25  but  for  33  Statute  Mile  Radius. 
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Figure  28.  Correlation  Coefficients  for  Region  3  (Southwest)  Within  100  Statute  Mile  Radius. 
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Figure  29.  Same  as  Figure  28  but  for  66  Statute  Mile  Radius. 
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Figure  30.  Same  as  Figure  28  but  for  33  Statute  Mile  Radius 
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Figure  31.  Correlation  Coefficients  for  Region  4  (Panhandle)  Within  100  Statute  Mile  Radius. 
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Figure  32.  Same  as  Figure  31  but  for  66  Statute  Mile  Radius. 
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Figure  33.  Same  as  Figure  31  but  for  33  Statute  Mile  Radius. 
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Figure  35.  Same  as  Figure  34  but  for  66  Statute  Mile  Radius. 
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Figure  36.  Same  as  Figure  34  but  for  33  Statute  Mile  Radius. 
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Figure  37.  Correlation  Coefficients  for  Region  6  (Southwest  Interior)  Within  100  Statute  Mile  Radius. 
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Figure  38.  Same  as  Figure  37  but  for  66  Statute  Mile  Radius. 
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Figure  39.  Same  as  Figure  37  but  for  33  Statute  Mile  Radius. 
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Figure  40.  Correlation  Coefficients  for  Region  7  (Gulf  Coast)  Within  100  Statute 
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Figure  43.  Correlation  Coefficients  for  Region  8  (Southeast  Coast)  Within  100  Statute  Mile  Radius. 
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Figure  44.  Same  as  Figure  43  but  for  66  Statute  Mile  Radius. 
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Figure  45.  Same  as  Figure  43  but  for  33  Statute  Mile  Radius. 
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Figure  46.  Correlation  Coefficient,  for  Region  9  (Great  Laker)  Within  100  Statute  Mile  Radius. 
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Figure  48.  Same  as  Figure  46  but  for  33  Statute  Mile  Radius 
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Figure  49.  Correlation  Coefficients  for  Region  10  (Appalachians)  Within  100  Statute  Mile  Radius. 
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Figure  50.  Same  as  Figure  49  but  for  66  Statute 
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Figure  51.  Same  as  Figure  49  but  for  33  Statute  Mile  Radius. 
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Figure  52.  Correlation  Coefficients  for  Region  1 1  (Northeast  Coast)  Within  100  Statute  Mile  radius. 
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Figure  53.  Same  as  Figure  52  but  for  66  Statute  Mile  Radius. 
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Figure  54.  Same  as  Figure  52  but  for  33  Statute  Mile  Radius. 
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Figure  55.  Correlation  Coefficients  for  Region  12  (Southeast)  Within  100  Statute  Mile  Radius. 
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Figure  56.  Same  as  Figure  55  but  for  66  Statute  Mile  Radius. 
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Figure  57.  Same  as  Figure  55  but  for  33  Statute  Mile  Radius. 


Figure  33  is  deceptive  because  there  were  no  severe  storm  cases  for  the  1992  part  of  the  data 
in  this  predictand  set;  in  both  1992  and  1993,  all-storm  correlations  are  being  compared  with  only 
1993  severe  storm  data.  Thus,  it  may  not  be  unusual  to  see  storm  correlation  coefficients  higher 
than  for  the  overall  storm  case.  By  the  same  token,  there  is  the  same  lack  of  1992  predictands  for 
Figure  32;  however,  here  the  all-storm  coefficients  happen  to  be  higher  than  the  severe  storm 
coefficients.  Because  of  the  lack  of  severe  storm  data,  no  significant  conclusion  should  be  drawn. 

There  is  a  similar  lack  of  1992  severe  storm  predictands  (again,  refer  to  Figures  58-63)  for  the 
remaining  CONUS  regions  and,  by  referring  to  Figures  58  and  59,  it  is  seen  that  both  1992  all¬ 
storm  and  severe  predictands  are  lower  in  number  across-the-board,  for  all  regions. 

One  additional  explanation  for  the  anomalies  in  the  correlation  coefficient  charts  is  that  the 
CONUS  regions  were  not  defined  by  the  same  number  of  stations:  Region  12  has  one  and  Region 
4  has  four,  whereas  Region  5  has  14  stations. 

Table  7  shows  the  top  ten  predictors,  by  region  and  predictand  type,  that  were  identified  by 
analyzing  correlation  coefficients.  At  this  point,  it  is  reasonable  to  ask,  without  having  any  test 
data  available  (forecast  fields  from  a  mesoscale  model  at  the  required  forecast  intervals),  what 
model(s)  should  be  developed,  given  the  fact  that  no  testing  will  be  done,  and  no  comparisons  can 
be  made  among  the  three  different  model  types  (those  developed  using  the  three  predictand 
types).  Given  this  situation,  it  was  decided  that  a  very  limited  amount  of  tests  would  be 
accomplished,  to  ensure  that  a  generalized  model,  developed  from  1992  and  1993  data  for  the  all 
storm  predictand  type,  produced  reasonable  outcomes.  More  exhaustive  follow-on  work  can  be 
accomplished  relatively  easy,  given  that  the  predictand/predictor  databases  have  been  constructed, 
edited,  and  manipulated  for  ease  of  use. 

Because  of  the  way  in  which  these  data  spreadsheets  have  been  constructed,  models  can  be 
constructed  using  individual  stations.  Thus,  if  one  wants  to  investigate  an  overseas  location 
which  is  climatologically  similar  to  one  of  our  71  stations,  a  storm  forecast  equation  can  be 
tailored  for  that  overseas  station  or  area. 
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Figure  59.  Same  as  in  Figure  68  but  for  Severe  Storm 
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Figure  62.  Same  as  in  Figure  68  but  for  33  Statute  Mile  Radius. 
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Table  7.  Top  Ten  Predictors  by  Region  and  Predictand  Type 
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There  are  other  possibilities  on  a  larger  scale-for  example,  a  set  of  our  CONUS  stations 
could  be  constructed  as  a  “region”  of  its  own,  apart  from  our  12  regions,  to  “match”  any 
overseas  region  desired  which  exhibits  similar  frequency  of  occurrence  of  thunderstorm  activity. 

The  next  section  recaps  the  limited  testing  accomplished  to  ensure  the  linear  model  we 
constructed  produced  reasonable  probabilities  for  thunderstorm  occurrence. 

5.  PREDICTION  EQUATIONS 


Tables  8  and  9  show  the  results  of  linear  model  construction  for  the  100  s.m.  predictand  and 
step  regression,  using  S-PLUS  advanced  data  analysis  software.  Only  one  predictor,  PMAX, 
dropped  out  of  the  1993  model  as  a  single  term  deletion  from  the  step  regression.  These  models 
and  those  for  the  66  s.m.  and  33  s.m.  predictands  are  listed  below: 


1993 

100  s.m. 

66  s.m. 

33  s.m. 

%  Probability  of  Occurrence 

=  -0.00563*K 
+0.03872*MSHOW 
-0. 04495  *TW850 
-0.00053*TE 
+0.003 14*TE850 
+0.02538*TW 
+0.0 1673  *TD700 
+0.00021  *LCLP 
+0.00727*TD850 
-0.15254 

=  -1 1.30133 
+0.02185*TE700 
+0.01442*TE850 
-0.06043  *TW850 
+0.0143  l*MSHOW 
-0.03967*TW700 
+0. 00362 *K 
+0.02166*TW 

=  -9.33211 
+0.00625*TE850 
+0.01078*MSHOW 
-0.029 1 8*TW850 
+0. 02365  *TE700 
+0.00954*TW 
-0.04191*TW700 
+0.00290*MLI300 
-0.0041 1*T850 

1992 

100  s.m. 

66  s.m. 

33  s.m. 

%  Probability  of  Occurrence 

=  -0.00285*K 

+0.03662*MSHOW 

-0. 04425  *TW850 

-0.00776*TE 

+0.00839*TE850 

+0.01299*TW 

+0.01 102*TD700 

+0.0007 1*LCLP 

+0.00073*PMAX 

-4.06986 

=  -6.25947 

+0.02258*TE850 

-0.06384*TW850 

+0.00146*K 

+0.00043  *MWBZP 

+0.00487*TW 

=  -6.97971 

+0.02057*TE850 

+0.00627*MSHOW 

-0.00071  *K 

-0.06408*TW850 

+0.00292*TE700 

-0.00496*TW 

+0.00233*MLI300 
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Table  8.  1993  Linear  Storm  Forecast  Model  Regression  Results 


Coefficients 

Predictors  Standard  Value 

(Intercept) 

-0.1526 

K 

-0.0059 

MSHOW 

0.0391 

TW850 

-0.0459 

TE 

-0.0005 

TE850 

0.0032 

TW 

0.0252 

TD700 

0.0171 

LCLP 

0.0002 

PMAX 

0.0001 

TD850 

0.0078 

Table  9.  1992  Linear  Storm  Forecast  Model  Regression  Results 

Coefficients 

Predictors  Standard  Value 

(Intercept) 

-4.0699 

K 

-0.0029 

MSHOW 

0.0366 

TW850 

-0.0443 

TE 

0.0077 

TE850 

0.0084 

TW 

-0.0130 

TD700 

0.0110 

LCLP 

-0.0007 

PMAX 

0.0007 

TD850 

-0.0021 

Table  10  summarizes  a  10-sounding  test  of  the  1993  and  1992  models.  Data  was  taken  from 
both  1992  +  1993  and  so  contains  dependent  and  independent  data.  Figure  64  illustrates  the 
similarity  in  the  results,  although  the  1993  percentages  are  higher  in  almost  every  case,  as  might 
be  expected.  This  is  not  the  case  for  the  other  types  of  predictands.  From  this  small  test  sample, 
it  appears  that  the  100  s.m.  predictand  would  provide  a  reasonable  storm  nowcast.* 

A  linear  model  was  also  created  and  limited  tests  conducted  for  maximum  storm  cloud  top, 
using  equilibrium  level  (EL)  height  as  the  predictand.  The  model  is  described  below,  and  was 
built  using  1993  data  only  for  those  soundings  when  EL  height  was  computed  by  the  SHARP 
program.  Figure  65  shows  the  overall  best  predictors  for  EL  height.  MPLHT  was  dropped, 
because  it  produced  ambiguous  results  for  severe  storm  soundings  when  MPLHT  was 
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Table  10.  Summary  of  Limited  Test  for  Thunderstorms 
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Note: _ severe  non-storm  non-storm  severe  severe  non-storm  storm  non-storm  svr  storm;  non-storm 

|  wx  case  case _ case _ case _ case _ case _ case  case  LI  =  15,  case 

Overall  Model  (100  s.m.) _ j>4 _ 32 _ 20 _ 78 _ 67 _ 13^ _ 76  24  93  25 

Overall  Model  (svr  (100  s.m.))  |  18^ _ 8 _ 1 _ 36  24  0  40  1  48  2 


1.200 
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Figure  65.  Correlation  Coefficients  for  Maximum  Storm  Cloud  Top  Height  Predictors. 


between  0-10,000  feet  (approx,  limits);  and  IN13  was  dropped  due  to  our  inability  to  define  this 
parameter. 


0.8084105 

BP 

-493.7144 

LI300 

507.6571 

TW 

-39.84367 

TEI 

148,9849 

TEMAXT 

-25504.33 

The  results  in  Table  1 1  are  promising,  since  the  model  produced  30,000+  foot  cloud  tops  for 
the  severe  storm  cases.  Test  sounding  #5  and  #7  are  anomalous;  however,  they  do  not  necessarily 
invalidate  the  model,  since  (in  the  case  of  #7)  storm  potential  is  not  always  realized  and  (in  the 
case  of  #5)  thunderstorm  tops  of  20-25,000  feet  are  not  uncommon.  Linear  model  summary  and 
step  regression  details  are  shown  in  Table  12. 


Table  11.  Summary  of  Limited  Test  for  Maximum  Cloud  Top  Height 


1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

BP 

472 

424 

1907 

0 

236 

1629 

1119 

15 

6054 

33 

LI300 

1 

-2 

-7 

0 

0 

-7 

-3 

1 

-18 

0 

TW 

20.3 

15.6 

23.1 

5.8 

17.2 

17.3 

22.4 

14.4 

28.2 

14.6 

TEI 

8.87 

25.88 

18.6 

19.73 

14.95 

23.7 

17 

15.21 

50.76 

6.45 

TEMAXT 

338.8 

339.64 

350.36 

333.13 

327.2 

351.08 

352.86 

329.59 

376.14 

325.6 

SIN 

VBG 

VBG 

DDC 

DDC 

OMA 

OMA 

JAN 

LZK 

DDC 

RAP 

YR 

1992 

1992 

1992 

1992 

1992 

1992 

1992 

1992 

1992 

1992 

MO 

7 

9 

9 

5 

5 

7 

7 

5 

7 

9 

DAY 

13 

14 

7 

6 

15 

2 

26 

3 

23 

13 

0 

0 

12 

12 

00 

12 

12 

12 

00 

12 

Km 

24,506 

25,396 

30,951 

23,340 

22,838 

30,630 

28,775 

22,511 

42.293 

22,775 

Note 

non¬ 

storm 

case 

non¬ 

storm 

case 

svr- 

storm 

case 

non¬ 

storm 

wx  case 

storm 

case 

svr- 

storm 

case 

Non¬ 

storm 

case 

non¬ 

storm 

case 

svr 

storm; 

L  =  -15, 
SWEAT 
=  595 

non¬ 

storm 

case 
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Table  12.  1993  Linear  Storm  Cloud  Top  Model  Regression  Results 


Coefficients 

Predictors  Standard  Value 

(Intercept) 

-25504.3300 

BP 

0.8084 

LI300 

-493.7144 

TW 

507.6571 

TEI 

-39.8437 

TEMAXT 

148.9849 

STD  ERROR 

3955.5300 

As  stated  in  the  Introduction,  prediction  equations  would  be  used  to  satisfy  Air  Force 
Weather’s  Theatre  Battle  Management  stated  requirements  and  these  requirements  included  the 
development  of  present  weather  prediction  techniques,  an  effort  running  in  conjunction  with  the 
thunderstorm  prediction  effort  described  here.  Researchers  working  on  present  weather 
prediction  techniques  identified  a  need  for  a  generalized  thunderstorm  and  severe  thunderstorm 
equation  based  on  100  statute  mile  radius  but  for  ±2  hours  in  time.  To  satisfy  this  need, 
generalized  thunderstorm  equations  were  developed  using  the  1992  and  1993  data  bases  and 
scheme  described  previously  but  limiting  severe  thunderstorm  occurrence  to  ±2  hours  as  derived 
from  the  Storm  Summary  data  base  and  including  the  “Lightning  -s”  predictands  (see  Figure  4). 
The  resulting  General  Thunderstorm  Global  Algorithm  and  Global  Severe  Thunderstorm 
Equation,  which  were  incorporated  in  the  present  weather  prediction  effort,  is  shown  in  Tables  13 
and  14. 
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Table  13.  General  Global  Thunderstorm  Linear  Storm  Forecast  Model  Regression  Results 


COEFFICIENTS 

PREDICTORS  STANDARD  VALUE 

(INTERCEPT) 

+127.3597 

LATITUDE 

+0.01219 

LONGITUDE 

-0.03100 

T 

+0.10387 

TD 

+0.05580 

TW 

-0.18570 

TE 

+0.06289 

T850 

-0.05212 

TD850 

-0.02513 

TW850 

+0.67128 

TE850 

-0.10506 

T700 

-0.14569 

TD700 

-0.05913 

TW700 

+0.61291 

TE700 

-0.10503 

HT700 

+0.00135 

T500 

-0.48531 

TD500 

-0.02522 

TW500 

+0.23918 

TE500 

+0.00323 

HT500 

-0.00575 

T300 

-0.01009 

TD300 

+0.25248 

TW300 

+0.24882 

TE300 

-0.13560 

HT300 

-0.00025 

HT200 

+0.00035 

BPLUS 

-0.00010 

H20 

+0.68911 

VERTT 

+0.55158 

CROSST 

+0.45551 

SWEAT 

-0.00008 

IN13 

+0.00071 

CAP 

-0.01504 

MAXT 

-0.04546 

dd06 

+0.00080 

DDSTRM 

-0.00262 

TEI 

-0.00358 

TEIPB 

-0.00008 

TEMAXT 

+0.01839 

TEMAXP 

+0.00021 

TEIMINT 

-0.01869 

IN35 

-0.53530 

PMAX 

+0.00088 

LCLP 

-0.00081 

WBZP 

-0.00130 

WBZHT 

+0.00001 

ELP 

-0.00086 

MPLP 

+0.00016 

ELHT 

+0.000004 
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MPLHT 

SHR4 

HEL4 

SHR6 

HEL6 

F1000 

FF1500 

DD1500 

FF2000 

DD2000 

FF2500 

DD2500 

FF3000 

DD3000 


+0.000005 

-0.05899 

-0.00095 

+0.02224 

-0.00101 

+0.04607 

-0.07350 

-0.00046 

+0.10574 

+0.00008 

-0.10079 

+0.00024 

+0.00052 

+0.00062 


Table  14.  Severe  Global  Thunderstorm  Linear  Storm  Forecast  Model  Regression  Results 


COEFFICIENTS 

(INTERCEPT) 

TW 

TD850 

TW850 

TE850 

BPJDKG 

CROSST 

SWEAT 

FFOT6 

FFSTRM 

SHRBRN 

IN35 

SHR4 

SHR6 


PREDICTORS  STANDARD  VALUE 
+14.889738 
+0.112705 
-0.353522 
+0.730811 
-0.090695 
+0.000105 
+0.307847 
-0.000272 
+0.003989 
+0.038964 
+0.014673 
-0.055614 
+0.056171 
-0.000667 


♦DETERMINANT  THRESHOLD:  >  .08  =  SEVERE  THUNDERSTORM:  <  .08  =  NO  SEVERE 
THUNDERSTORM.  POD  =  0.627.  FAR  =  0.230. 
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6.  PRELIMINARY  TEST  USING  RAOB  DATA 


Figure  66  test  results  appear  promising.  To  draw  a  more  confident  conclusion  from  these 
limited  results  more  information  about  the  test  soundings  and  the  associated  local  and  synoptic 
weather  patterns  would  be  helpful.  Below  is  some  background  information: 

Storm  Cases 

Svr  storm,  12Z  7/13/93,  EYW:  At  least  two  funnel  clouds  were  sighted,  and  1.5”  diameter 
hail  was  reported  over  South  Florida  locations  this  day.  KI  =  20;  Showalter  Index  =  5;  Lifted 
Index  =  -8. 

Svr  storm,  OOZ  7/5/93,  PIT:  Severe  thunderstorm  winds  were  reported.  KI  =  42;  Showalter 
Index  =  -5;  Lifted  Index  =  -6. 

Svr  storm,  OOZ  5/5/92,  BRO:  3/4”  diameter  hail  and  wind  gusts  to  60  mph  were  reported.  At 
1256CST,  a  waterspout  was  reported  in  Galveston  City.  KI  =  36;  Showalter  Index  =  -1;  Lifted 
Index  =  -7. 

Storm,  OOZ  7/9/92,  EDW:  Tropical  moisture  associated  with  Tropical  Storm  Darby  moved 
into  southern  California  and  brought  unseasonal  rains  to  the  area,  ranging  from  a  trace  to  3/4”  in 
the  mountains.  KI  =  42;  Showalter  Index  =  -3;  Lifted  Index  =  -9. 

Severe  storm,  OOZ  7/23/92,  DDC:  Severe  storms  dropped  large  hail  (up  to  1.75”  diameter  in 
surrounding  areas)  and  heavy  flooding  rainfall.  KI  =  45;  Showalter  Index  =  -9;  Lifted  Index  =  - 
18. 


No  storm,  12Z  5/12/93,  EYW:  No  storm  activity  reported  from  8-18  May  93.  KI  =  9; 
Showalter  Index  =  9;  Lifted  Index  =  1 . 

No  storm,  OOZ  5/23/93,  SLE:  No  storm  activity  from  20-24  May  93.  KI  =  3;  Showalter 
Index  =12;  Lifted  Index  =  1. 

No  storm,  OOZ  7/1/92,  EDW:  No  storm  activity  in  the  area  for  entire  month  of  June  and  first 
half  of  July  92.  KI  =  5;  Showalter  Index  =  10;  Lifted  Index  =  2. 

No  storm,  OOZ  5/16/92,  OAK  No  storm  activity  in  the  area  for  entire  month.  KI  =  9; 
Showalter  Index  =  8;  Lifted  Index  =  0. 

No  storm,  12Z  9/10/92,  DDC:  No  storms  in  the  area  for  ~  10  days.  KI  =  1;  Showalter  Index 
=  13;  Lifted  Index  =  2. 

In  retrospect,  more  testing  would  help  substantiate  these  limited  but  promising  results. 
Additionally,  forecast  predictors  would  help  to  tailor  this  study’s  models  for  forecast  periods  of 
interest. 


89 


Now  that  we  have  an  idea  of  the  general  character  of  1992  and  1993  thunderstorms  over  our 
CONUS  regional  network  with  respect  to  our  chosen  set  of  predictors,  and  have  accomplished 
some  limited  tests,  we  did  the  following:  1)  developed  generalized  (1992-1992)  models  to 
diagnose  (“nowcast”)  storm  and  severe  storm  occurrence  (see  Table  13);  2)  conducted  limited 
testing  of  both  models  (see  Figure  66);  3)  developed  similar  “generalized”  models  for  each  of  the 
12  CONUS  regions  (see  Table  14). 


Table  15.  Generalized  (1992-1993)  Overall  and  Severe  Storm  Models  for  all  Regions  at 

100  Statute  Mile  Radius 


Coefficients 

(Intercept) 

K 

MLI300 

MSHOW 

PMAX 

TD 

TD700 

TD850 

TE 

TE700 
TE850 
TW 
TW700 
TW850 
STD  Error 


Region  All/Type  All 
-5.0264 
-0.0047 

0.0235 

0.0001 

0.0120 

0.0054 

-0.0009 

0.0194 

0.0193 

-0.0750 

0.4472 


Region  All/Type  Severe 
-10.4451 

0.0020 

0.0032 

0.0053 

0.0014 

-0.0016 

0.0128 

0.0226 

0.0078 

-0.0252 

-0.0643 

0.2959 
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66.  Limited  Test  Results  for  Generalized  Case  (1992  and  1993). 


Table  16.  Generalized  (1992  and  1993)  Overall  and  Severe  Storm  Model  Statistics  for  12  CONUS  Regions 

at  100  Statute  Mile  Radius. 
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Table  16.  (continued) 
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Table  16.  (continued) 
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7.  ALGORITHM  TEST  USING  MM5  FORECAST 


A  limited  study  using  MM5  forecasts  was  performed  on  the  thunderstorm  algorithm  models 
described  earlier  in  this  report.  The  study  consisted  of  four  main  phases;  generating  forecasts 
with  the  Penn  State/NCAR  Mesoscale  Model  (MM5),  using  the  output  from  MM5  for  input  to 
Skew-T,  Hodograph  Analysis  and  Research  Program  (SHARP)  to  produce  the  thunderstorm 
predictor  values;  using  the  spreadsheet  generated  by  SHARP  to  compute  thunderstorm 
probabilities  with  the  regional  and  general  thunderstorm  models;  and  evaluate  the  forecast 
probabilities  generated  by  the  models  with  observed  data. 

7.1  MM5  Forecast 

The  MM5  system  was  used  in  its  basic  form.  The  only  deviation  from  a  normal  MM5  run  was 
to  use  MRF  forecast  files  for  the  initial  and  boundary  condition.  The  model  produced  the  analysis, 
12,  24,  36,  and  48  hour  forecasts  for  both  the  0  and  12  Z  initial  times  for  the  31  days  of  August 
1993.  The  whole  suite  of  MM5  pre-  and  post-  processors  were  used  for  this  study:  TERRAIN, 
DATAGRID,  RAWINS,  and  INTERP  (front-end  and  back-end).  The  inner  domain  was  defined 
with  a  46  by  61  grid  with  a  resolution  of  80  kilometers  over  the  continental  U.S.  (CONUS).  The 
hydrostatic  version  of  the  model  was  used  in  this  study.  The  standard  output  files  from  the  post¬ 
processor  INTERP,  executed  in  back-end  mode,  were  used  in  the  processing  steps.  This  process 
generated  62  (2  *  31  days)  output  files.  Most  output  files  contained  five  forecast  durations. 
There  were  a  total  of  304  forecasts  out  of  a  possible  310  forecasts  (62  *  5  forecasts)  when  all  the 
production  runs  were  complete.  There  were  six  missing  forecast  durations  from  three  model  runs 
due  to  unknown  model  errors  and  there  was  not  enough  time  to  find  and  rectify  them. 
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7.2  SHARP 


Before  running  SHARP,  the  MM5  output  had  to  be  interpolated  from  its  gridded  format  to 
the  69  RAOB  site  locations  (Table  2)  and  converted  into  files  that  SHARP  can  read  (Figure  1).  A 
FORTRAN  program  was  developed  to  interpolate  to  the  RAOB  sites  and  create  files  that  mimic  a 
RAOB  record  file.  This  process  created  20976  files  (304  forecasts  *  69  RAOB  stations). 

A  Microsoft  Excel  macro  was  then  used  to  create  the  input  files  for  SHARP  with  the  newly 
created  RAOB  files.  SHARP  was  then  used  to  compute  the  predictors  (Table  1)  for  the 
thunderstorm  models.  SHARP  created  five  formatted  files,  one  for  each  forecast  duration, 
containing  4278  (62  *  69)  records.  These  data  files  were  created  in  column  form  for  easy  ingest 
to  a  spreadsheet. 

7.3  Thunderstorm  Probabilities 

The  data  files  created  by  SHARP  were  imported  into  Excel  spreadsheets.  The  “all 

thunderstorms”  models,  both  regional  and  general,  and  the  “severe  thunderstorms”  models,  both 
regional  and  general,  were  used  to  generate  probabilities  using  the  predictors  supplied  by  SHARP. 

At  this  point  in  the  study,  some  concerns  came  to  light.  There  are  seven  predictors  that  rely 
on  quantities  from  the  850mb  level.  Some  of  the  models  use  the  temperature,  dewpoint 
temperature,  wetbulb  temperature,  and  theta-E  at  the  850mb  level.  Also,  some  models  use  the  K 
Index,  SWEAT  Index,  and  the  Showalter  Index  values  that  are  derived  from  850mb  quantities. 
Some  RAOB  station  elevations  were  higher  than  the  850mb  level.  Regions  two  and  three  were 
comprised  of  stations  that  were  higher.  So  the  resulting  probabilities  were  not  valid  due  to 
missing  predictors.  These  two  regions  were  not  evaluated  any  further.  Also,  the  RAOB  station 
for  Denver  was  excluded  from  the  evaluation  of  the  region  five  models.  Further  study  may  be 
needed  to  consider  elevation  and  the  predictors  that  are  needed. 
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7.4  Results  of  Forecast  Experiments 

We  validated  the  forecasts  of  the  probability  of  all  thunderstorms  and  severe  thunderstorms 
separately  for  seven  of  the  regions.  This  was  accomplished  by  comparing  the  diagnosed 
probability  of  thunderstorm  occurrence  at  each  station  within  a  given  region  with  the 
corresponding  observation  of  thunderstorm  occurrence  at  the  forecast  valid  time.  The  validation 
measure  used  was  the  mean-squared  accuracy  of  the  diagnosed  probability: 


_  1  m 

P  =  —Y(Pm-Omf 


where: 


P  =  Probability  computed  from  the  model 
O  =  Storm  (1)/  No  Storm  (0) 


M=  Number  of  observations  (all  stations  in  a  region  for  all  forecasts). 

The  smaller  the  value  of  P,  the  better  the  accuracy  of  the  predicted  probability.  Table  17  lists  the 
results  of  the  validation  of  the  predictions  of  the  “all  thunderstorms”  diagnosis  models  against  all 
thunderstorms,  while  the  results  given  in  Table  18  are  a  product  of  the  validation  of  the  “severe 
thunderstorms”  models  against  just  the  severe  thunderstorm  occurrences  (that  is,  Om  =  0  for  non- 
severe  thunderstorms).  The  column  key  for  the  tables  is  as  follows:  column  A  -  probability 
computed  with  regional  models;  column  B  -  probability  computed  with  region  5  model  using 
region  6  data  /  region  6  model  using  region  5  data;  column  C  -  probability  computed  with 
generalized  models;  column  D  -  probability  is  zero;  column  E  -probability  is  randomly  generated; 
column  F  —probability  randomly  selected  from  probabilities  generated  by  the  regional  models; 
column  G  —probability  randomly  selected  from  probabilities  generated  by  the  generalized  models; 
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Column  H  -probability  bias  (remove  the  squared  exponent  from  the  above  equation)  based  on 
probability  computed  with  regional  models. 

In  column  A  of  the  tables,  we  list  the  validation  scores  where  the  probability  was  computed 
from  the  diagnosis  model  developed  for  the  respective  region.  The  most  notable  trend  in  these 
scores,  are  the  better  skill  shown  for  the  severe  thunderstorms  in  all  regions.  We  attribute  this  to 
the  fact  that,  in  the  data  base,  there  were  many  fewer  occurrences  of  severe  thunderstorms  than 
non-severe  thunderstorms.  Thus,  a  zero  probability  of  severe  thunderstorms  is  likely  to  be  a  good 
forecast,  and  it  is  much  easier  to  obtain  good  forecasts  without  a  skillful  technique.  This  is  shown 
to  be  true  by  an  examination  of  column  D,  in  which  a  zero  probability  was  used  in  place  of  the 
computed  probability  in  the  validation.  It  is  clear  that,  with  the  exception  of  region  8,  the  skill  of 
the  zero  probability  forecast  is  as  good  or  better  for  severe  thunderstorms  than  the  model’s  skill. 
This  is  clearly  not  true  for  the  all  thunderstorms  verification,  in  which  there  are  many  more 
occurrences  of  thunderstorms  making  a  perpetual  zero  probability  a  bad  forecast.  We  do  not  see 
a  significant  trend  of  skill  change  with  forecast  duration  in  either  table.  While  there  are  some 
regions  in  which  the  greater  forecast  durations  show  worse  scores,  we  doubt  that  the  differences 
between  these  scores  and  those  at  smaller  durations  are  significant.  Some  regions  show  better 
scores  than  others:  for  all  thunderstorms,  region  4  has  the  best  scores  while  regions  6  and  1 1  are 
virtually  tied  for  the  worst.  In  contrast,  for  severe  thunderstorms,  region  1 1  has  the  best  results 
while  regions  4  and  6  are  the  worst. 
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Table  17.  Evaluation  of  the  All  Thunderstorms  Models 
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Table  18.  Evaluation  of  the  Severe  Thunderstorms  Models 
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We  wished  to  see  how  sensitive  the  regional  predicted  thunderstorm  probability  scores  were 
to  the  model  used  to  generate  them.  In  column  B,  we  show  the  results  of  using  the  region  6  model 
on  the  region  5  forecast  data  (column  B,  region  5),  and  region  5  model  on  region  6  forecast  data 
(column  B,  region  6).  This  pair  of  regions  was  chosen  for  this  experiment  because  of  the 
significance  of  the  difference  in  skill  shown  in  column  A  for  the  two  regions  (that  is,  when  their 
respective  model  was  used).  The  fact  that  the  results  change  very  little  from  their  column  A 
counterparts  in  either  table  indicates  that  the  model  used  was  less  important  than  the  data  set  it 
was  applied  to  and  verified  against.  Region  5  has  better  scores  than  region  6  regardless  of  which 
model  was  used  to  diagnose  probability.  This  suggests  that,  at  least  for  these  two  regions,  the 
diagnosis  models  are  essentially  interchangeable. 

Another  test  of  the  sensitivity  of  the  predicted  probability  skill  to  the  diagnosis  model  used  is 
to  use  the  generalized  model  (that  is,  the  model  developed  using  the  data  for  all  regions)  in  all 
regions.  The  results  for  this  experiment  are  given  in  column  C.  The  degree  of  change  of  skill 
seems  to  vary  by  region.  Only  in  region  4  for  all  thunderstorms  does  the  skill  level  drop 
significantly  when  the  generalized  model  is  used  instead  of  the  regional  model.  In  some  regions  (7 
and  10,  possibly  8  and  11  for  all  thunderstorms;  4,5,6,10,  and  11  for  severe  thunderstorms)  the 
skill  level  actually  seems  to  improve.  In  the  remaining  regions  there  is  no  appreciable  change  in 
skill.  Overall,  this  suggests  that  the  skill  obtained  by  the  generalized  model  is  no  worse  than,  and 
possibly  even  better  than,  that  from  the  application  of  regional  models  to  their  own  regions.  Here 
again,  the  model  used  seems  to  be  less  important  than  the  data  it  is  applied  to  in  determining 
predicted  probability  skill. 

In  order  to  show  a  measure  of  relative  skill  of  a  forecast  procedure,  it  is  of  interest  to  examine 
the  skill  obtained  as  compared  with  the  skill  resulting  from  a  random  forecast.  This  comparison 
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can  be  made  by  comparing  the  scores  in  columns  A  (C)  with  the  scores  in  E  when  evaluating  the 
skill  of  the  regional  (generalized)  model.  It  is  clear  from  the  results  that  the  model  does  show  skill 
with  respect  to  a  random  forecast  of  thunderstorm  probability  in  both  non-severe  and  severe 
scenarios.  However,  this  differential  is  decreased  when  the  random  probabilities  are  chosen 
randomly  from  the  pool  of  probabilities  diagnosed  by  the  regional  (generalized)  model,  column  F 
(column  G),  rather  than  purely  randomly  assigned  (column  E).  The  fact  that  the  column  F  (G) 
scores  are  better  than  column  E  but  worse  than  column  A  (C)  suggests  that:  use  of  the  model  can 
narrow  down  the  probability  range  from  that  of  a  random  guess;  there  is  even  some  skill 
associated  with  the  location  and  time  of  the  diagnosis  as  opposed  to  randomly  choosing  a 
probability  that  is  typical  of  the  continental  US  in  August;  a  forecast  from  a  typical  probability 
range  for  a  region  and  season  (like  that  which  could  be  derived  just  from  the  thunderstorm 
occurrence  data)  is  probably  better  than  a  fully  random  guess.  In  a  few  of  the  regions,  the  skill 
from  randomly  selecting  diagnosed  probabilities  from  the  generalized  model  (column  G)  is  better 
than  from  randomly  selecting  probabilities  diagnosed  from  the  model  applied  to  its  respective 
region.  In  no  cases  is  the  opposite  true.  This  seems  to  reinforce  the  suggestion  that  the 
generalized  model  is  a  suitable  replacement  for  the  regional  model  in  essentially  any  region. 

Column  H  gives  the  bias  of  the  probabilities  diagnosed  by  the  regional  models  (that  is,  remove 
the  squared  factor  in  the  above  equation).  In  the  all-thunderstorms  validation,  the  results  show 
that  there  is  a  tendency  to  underpredict  the  probability  of  thunderstorms  in  regions  7,  8,  and  10. 
There  seems  to  be  no  tendency  to  overpredict  non-severe  thunderstorms  in  any  of  the  regions. 
Just  the  opposite  trend  is  observed  for  severe  thunderstorms.  The  tendency  to  overpredict 
occurrence  of  severe  thunderstorms  is  expected  since  their  actual  frequency  of  occurrence  is  so 
low. 
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In  summary,  the  validation  results  show  that  the  skill  of  predicting  the  probability  of 
thunderstorm  occurrence  by  the  perfect  prog  models  developed  in  this  project  is  relatively 
insensitive  to  the  model  used.  The  difference  in  skill  between  regions  is  greater  than  the 
difference  in  skill  when  two  different  models  are  applied  to  the  same  region.  The  results  suggest 
that  the  generalized  model  can  be  substituted  for  a  region’s  model  without  any  appreciable  loss  of 
skill  (and  perhaps  even  an  improvement  in  skill)  in  that  region.  The  diagnosis  models  show  skill 
with  respect  to  purely  random  probability  forecasts  and  probability  forecasts  assigned  randomly 
from  a  range  of  reasonable  probabilities.  The  tendency  of  the  regional  models  to  underpredict 
probabilities  of  all  severities  of  thunderstorms  is  regionally  dependent,  but  probably  not  model 
dependent.  The  models  don’t  seem  to  have  a  tendency  to  overpredict  except  where  the  actual 
occurrence  of  thunderstorms  is  rare. 
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